# Omni-modal Interaction
Qwen2.5 Omni 7B GPTQ Int4
Other
Qwen2.5-Omni is an end-to-end multimodal model capable of perceiving various modalities such as text, images, audio, and video, and generating text and natural speech responses in a streaming manner.
Multimodal Fusion
Transformers English

Q
Qwen
389
8
Qwen2.5 Omni 7B
Other
Qwen2.5-Omni is an end-to-end multimodal model capable of perceiving various modalities such as text, images, audio, and video, and generating text and natural speech responses in a streaming manner.
Multimodal Fusion
Transformers English

Q
Qwen
206.20k
1,522
Featured Recommended AI Models